Search CORE

51 research outputs found

Optimal Clustering under Uncertainty

Author: Benalcázar Marco E.
Dalton Lori A.
Dougherty Edward R.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2018
Field of study

Classical clustering algorithms typically either lack an underlying probability framework to make them predictive or focus on parameter estimation rather than defining and minimizing a notion of error. Recent work addresses these issues by developing a probabilistic framework based on the theory of random labeled point processes and characterizing a Bayes clusterer that minimizes the number of misclustered points. The Bayes clusterer is analogous to the Bayes classifier. Whereas determining a Bayes classifier requires full knowledge of the feature-label distribution, deriving a Bayes clusterer requires full knowledge of the point process. When uncertain of the point process, one would like to find a robust clusterer that is optimal over the uncertainty, just as one may find optimal robust classifiers with uncertain feature-label distributions. Herein, we derive an optimal robust clusterer by first finding an effective random point process that incorporates all randomness within its own probabilistic structure and from which a Bayes clusterer can be derived that provides an optimal robust clusterer relative to the uncertainty. This is analogous to the use of effective class-conditional distributions in robust classification. After evaluating the performance of robust clusterers in synthetic mixtures of Gaussians models, we apply the framework to granular imaging, where we make use of the asymptotic granulometric moment theory for granular images to relate robust clustering theory to the application.Comment: 19 pages, 5 eps figures, 1 tabl

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare

On optimal Bayesian classification and risk estimation under multiple classes

Author: A Zollanvari
B Efron
B Efron
B Efron
B Hanczar
B Hanczar
BE Boser
C Cortes
C-C Chang
CM Bishop
ER Dougherty
H Xu
H Xu
JM Knight
L Devroye
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
LA Dalton
Lori A. Dalton
MJ van de Vijver
Mohammadmahdi R. Yousefi
MR Yousefi
MR Yousefi
MR Yousefi
MS Esfahani
NL Johnson
S Kotz
UM Braga-Neto
UM Braga-Neto
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Prospectus, September 24, 1986

Author: Christy Melanie
Dalton Matt
Davis Kenneth J.
DeSchepper Mary
Erb Kevin A.
Heal Andy
Parks John
Rhode Loraine Lori
Schaffer Eric L.
Smith Nancy J.
Zimmerman Jim
Publication venue: SPARK: Scholarship at Parkland
Publication date: 24/09/1986
Field of study

https://spark.parkland.edu/prospectus_1986/1022/thumbnail.jp

Scholarship at Parkland

Optimal cancer prognosis under network uncertainty

Author: A Datta
A Garg
A Naldi
BJ Yoon
E Bilal
E Lee
F Li
I Ivanov
I Shmulevich
J Su
J Su
LA Dalton
LA Dalton
Lori A Dalton
M Kanehisa
M Shahrokh Esfahani
M Shahrokh Esfahani
Mohammadmahdi R Yousefi
MR Yousefi
MR Yousefi
X Qian
Y Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Prevalent, protective, and convergent IgG recognition of SARS-CoV-2 non-RBD spike epitopes

The molecular composition and binding epitopes of the immunoglobulin G (IgG) antibodies that circulate in blood plasma following SARS-CoV-2 infection are unknown. Proteomic deconvolution of the IgG repertoire to the spike glycoprotein in convalescent subjects revealed that the response is directed predominantly (>80%) against epitopes residing outside the receptor-binding domain (RBD). In one subject, just four IgG lineages accounted for 93.5% of the response, including an N-terminal domain (NTD)-directed antibody that was protective against lethal viral challenge. Genetic, structural, and functional characterization of a multi-donor class of “public” antibodies revealed an NTD epitope that is recurrently mutated among emerging SARS-CoV-2 variants of concern. These data show that “public” NTD-directed and other non-RBD plasma antibodies are prevalent and have implications for SARS-CoV-2 protection and antibody escape

PubMed Central

Carolina Digital Repository

Race, Family Structure, and Wealth: The Effect of Childhood Family on Adult Asset Ownership

Author: Campbell Lori A.
Cherlin Andrew J.
Cherlin Andrew J.
Conley Dalton
Danziger Sheldon H.
Duncan Greg J.
Greene William
Judge George G.
Keister Lisa A.
McLanahan Sara S.
Mott Frank L.
Nestel G.
Oliver Melvin O.
Stack Carol
Stirling Kate
Wilson William J.
Wolff Edward N.
Wolff Edward N.
Publication venue: 'University of California Press'
Publication date
Field of study

Crossref

Heuristic algorithms for feature selection under Bayesian models with block-diagonal covariance structure

Author: Ali Foroughi pour
Lori A. Dalton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2018
Field of study

Abstract Background Many bioinformatics studies aim to identify markers, or features, that can be used to discriminate between distinct groups. In problems where strong individual markers are not available, or where interactions between gene products are of primary interest, it may be necessary to consider combinations of features as a marker family. To this end, recent work proposes a hierarchical Bayesian framework for feature selection that places a prior on the set of features we wish to select and on the label-conditioned feature distribution. While an analytical posterior under Gaussian models with block covariance structures is available, the optimal feature selection algorithm for this model remains intractable since it requires evaluating the posterior over the space of all possible covariance block structures and feature-block assignments. To address this computational barrier, in prior work we proposed a simple suboptimal algorithm, 2MNC-Robust, with robust performance across the space of block structures. Here, we present three new heuristic feature selection algorithms. Results The proposed algorithms outperform 2MNC-Robust and many other popular feature selection algorithms on synthetic data. In addition, enrichment analysis on real breast cancer, colon cancer, and Leukemia data indicates they also output many of the genes and pathways linked to the cancers under study. Conclusions Bayesian feature selection is a promising framework for small-sample high-dimensional data, in particular biomarker discovery applications. When applied to cancer data these algorithms outputted many genes already shown to be involved in cancer as well as potentially new biomarkers. Furthermore, one of the proposed algorithms, SPM, outputs blocks of heavily correlated genes, particularly useful for studying gene interactions and gene networks

Directory of Open Access Journals

On optimal Bayesian classification and risk estimation under multiple classes

Author: Lori A. Dalton
Mohammadmahdi R. Yousefi
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Springer - Publisher Connector